Algorithms in Bioinformatics

chapter

Measures of Codon Bias in Yeast, the tRNA Pairing Index and Possible DNA Repair Mechanisms

Markus T. Friberg, Pedro Gonnet, Yves Barral, Nicol N. Schraudolph, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 1-11

Protein translation is a rapid and accurate process, which has been optimized by evolution. Recently, it has been shown that tRNA reusage influences translation speed. We present the tRNA Pairing Index (TPI), a novel index to measure the degree of tRNA reusage in any gene. We describe two variants of the index, how to combine various such indices to a single one and an efficient algorithm for their...

chapter

Decomposing Metabolomic Isotope Patterns

Sebastian Böcker, Matthias C. Letzel, Zsuzsanna Lipták, Anton Pervukhin

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 12-23

We present a method for determining the sum formula of metabolites solely from their mass and isotope pattern. Metabolites, such as sugars or lipids, participate in almost all cellular processes, but the majority still remains uncharacterized. Our input is a measured isotope pattern from a high resolution mass spectrometer, and we want to find those molecules that best match this pattern. Determination...

chapter

A Method to Design Standard HMMs with Desired Length Distribution for Biological Sequence Analysis

Hongmei Zhu, Jiaxin Wang, Zehong Yang, Yixu Song

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 24-31

Motivation: Hidden Markov Models (HMMs) have been widely used for biological sequence analysis. When modeling a phenomenon where for instance the nucleotide distribution does not change for various length of DNA, there are two popular approaches to achieve a desired length distribution: explicit or implicit modeling. The implicit modeling requires an elaborately designed model structure...

chapter

Efficient Model-Based Clustering for LC-MS Data

Marta Łuksza, Bogusław Kluge, Jerzy Ostrowski, Jakub Karczmarski, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 32-43

Proteomic mass spectrometry is gaining an increasing role in diagnostics and in studies on protein complexes and biological systems. The issue of high-throughput data processing is therefore becoming more and more significant. The problems of data imperfectness, presence of noise and of various errors introduced during experiments arise. In this paper we focus on the peak alignment problem...

chapter

A Bayesian Algorithm for Reconstructing Two-Component Signaling Networks

Lukas Burger, Erik Nimwegen

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 44-55

We present an algorithm, based on a Bayesian network model, for ab initio prediction of signaling interactions in bacterial two-component systems. The algorithm uses a large training set of known interacting kinase/receiver pairs to build a probabilistic model of dependency between the amino acid sequences of the two proteins and uses this model to predict which pairs interact. We show that the algorithm...

chapter

Linear-Time Haplotype Inference on Pedigrees Without Recombinations

M. Y. Chan, Wun-Tat Chan, Francis Y. L. Chin, Stanley P. Y. Fung, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 56-67

In this paper, a linear-time algorithm, which is optimal, is presented to solve the haplotype inference problem for pedigree data when there are no recombinations and the pedigree has no mating loops. The approach is based on the use of graphs to capture SNP, Mendelian and parity constraints of the given pedigree.

chapter

Phylogenetic Network Inferences Through Efficient Haplotyping

Yinglei Song, Chunmei Liu, Russell L. Malmberg, Liming Cai

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 68-79

The genotype phasing problem is to determine the haplotypes of diploid individuals from their genotypes where linkage relationships are not known. Based on the model of perfect phylogeny, the genotype phasing problem can be solved in linear time. However, recombinations may occur and the perfect phylogeny model thus cannot interpret genotype data with recombinations. This paper develops a graph theoretical...

chapter

Beaches of Islands of Tractability: Algorithms for Parsimony and Minimum Perfect Phylogeny Haplotyping Problems

Leo Iersel, Judith Keijsper, Steven Kelk, Leen Stougie

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 80-91

The problem Parsimony Haplotyping (PH) asks for the smallest set of haplotypes which can explain a given set of genotypes, and the problem Minimum Perfect Phylogeny Haplotyping (MPPH) asks for the smallest such set which also allows the haplotypes to be embedded in a perfect phylogeny evolutionary tree, a well-known biologically-motivated data structure. For PH we extend recent work of [17] by further...

chapter

On the Complexity of SNP Block Partitioning Under the Perfect Phylogeny Model

Jens Gramm, Tzvika Hartman, Till Nierhoff, Roded Sharan, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 92-102

Recent technologies for typing single nucleotide polymorphisms (SNPs) across a population are producing genome-wide genotype data for tens of thousands of SNP sites. The emergence of such large data sets underscores the importance of algorithms for large-scale haplotyping. Common haplotyping approaches first partition the SNPs into blocks of high linkage-disequilibrium, and then infer haplotypes for...

chapter

How Many Transcripts Does It Take to Reconstruct the Splice Graph?

Paul Jenkins, Rune Lyngsø, Jotun Hein

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 103-114

Alternative splicing has emerged as an important biological process which increases the number of transcripts obtainable from a gene. Given a sample of transcripts, the alternative splicing graph (ASG) can be constructed—a mathematical object minimally explaining these transcripts. Most research has so far been devoted to the reconstruction of ASGs from a sample of transcripts, but little has been...

chapter

Multiple Structure Alignment and Consensus Identification for Proteins

Jieping Ye, Ivaylo Ilinkin, Ravi Janardan, Adam Isom

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 115-125

An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus structure which captures common substructures present in the given proteins. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins. A distinguishing...

chapter

Procrastination Leads to Efficient Filtration for Local Multiple Alignment

Aaron E. Darling, Todd J. Treangen, Louxin Zhang, Carla Kuiken, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 126-137

We describe an efficient local multiple alignment filtration heuristic for identification of conserved regions in one or more DNA sequences. The method incorporates several novel ideas: (1) palindromic spaced seed patterns to match both DNA strands simultaneously, (2) seed extension (chaining) in order of decreasing multiplicity, and (3) procrastination when low multiplicity matches are encountered...

chapter

Controlling Size When Aligning Multiple Genomic Sequences with Duplications

Minmei Hou, Piotr Berman, Louxin Zhang, Webb Miller

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 138-149

For a genomic region containing a tandem gene cluster, a proper set of alignments needs to align only orthologous segments, i.e., those separated by a speciation event. Otherwise, methods for finding regions under evolutionary selection will not perform properly. Conversely, the alignments should indicate every orthologous pair of genes or genomic segments. Attaining this goal in practice requires...

chapter

Reducing Distortion in Phylogenetic Networks

Daniel H. Huson, Mike A. Steel, Jim Whitfield

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 150-161

When multiple genes are used in a phylogenetic study, the result is often a collection of incompatible trees. Phylogenetic networks and super-networks can be employed to analyze and visualize the incompatible signals in such a data set. In many situations, it is important to have control over the amount of imcompatibility that is represented in a phylogenetic network, for example reducing noise by...

chapter

Imputing Supertrees and Supernetworks from Quartets

Barbara Hollan, Glenn Conner, Katharina T. Huber, Vincent Moulton

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 162-162

A contemporary and sometimes contentious problem in genome phylogeny is to reconcile the fact that an accurately reconstructed gene tree does not necessarily correspond to a species phylogeny. Thus, in practice, species phylogenies are commonly obtained by applying consensus tree/supertree methods to collections of gene trees. However, such methods can suppress true conflicts in gene trees arising...

chapter

A Unifying View of Genome Rearrangements

Anne Bergeron, Julia Mixtacki, Jens Stoye

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 163-173

Genome rearrangements have been modeled by a variety of operations such as inversions, translocations, fissions, fusions, transpositions and block interchanges. The double cut and join operation, introduced by Yancopoulos et al., allows to model all the classical operations while simplifying the algorithms. In this paper we show a simple way to apply this operation to the most general type of genomes...

chapter

Efficient Sampling of Transpositions and Inverted Transpositions for Bayesian MCMC

István Miklós, Timothy Brooks Paige, Péter Ligeti

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 174-185

The evolutionary distance between two organisms can be determined by comparing the order of appearance of orthologous genes in their genomes. Above the numerous parsimony approaches that try to obtain the shortest sequence of rearrangement operations sorting one genome into the other, Bayesian Markov chain Monte Carlo methods have been introduced a few years ago. The computational time for convergence...

chapter

Alignment with Non-overlapping Inversions in O(n ³)-Time

Augusto F. Vellozo, Carlos E. R. Alves, Alair Pereira Lago

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 186-196

Alignments of sequences are widely used for biological sequence comparisons. Only biological events like mutations, insertions and deletions are usually modeled and other biological events like inversions are not automatically detected by the usual alignment algorithms. Alignment with inversions does not have a known polynomial algorithm and a simplification to the problem that considers only...

chapter

Accelerating Motif Discovery: Motif Matching on Parallel Hardware

Geir Kjetil Sandve, Magnar Nedland, Øyvind Bø Syrstad, Lars Andreas Eidsheim, more

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 197-206

Discovery of motifs in biological sequences is an important problem, and several computational methods have been developed to date. One of the main limitations of the established motif discovery methods is that the running time is prohibitive for very large data sets, such as upstream regions of large sets of cell-cycle regulated genes. Parallel versions have been developed for some of these methods,...

chapter

Segmenting Motifs in Protein-Protein Interface Surfaces

Jeff M. Phillips, Johannes Rudolph, Pankaj K. Agarwal

Lecture Notes in Computer Science > Algorithms in Bioinformatics > 207-218

Protein-protein interactions form the basis for many intercellular events. In this paper we develop a tool for understanding the structure of these interactions. Specifically, we define a method for identifying a set of structural motifs on protein-protein interface surfaces. These motifs are secondary structures, akin to α-helices and β-sheets in protein structure; they describe how multiple residues...

INFONA - science communication portal

Algorithms in Bioinformatics
6th International Workshop, WABI 2006, Zurich, Switzerland, September 11-13, 2006. Proceedings

Measures of Codon Bias in Yeast, the tRNA Pairing Index and Possible DNA Repair Mechanisms

Decomposing Metabolomic Isotope Patterns

A Method to Design Standard HMMs with Desired Length Distribution for Biological Sequence Analysis

Efficient Model-Based Clustering for LC-MS Data

A Bayesian Algorithm for Reconstructing Two-Component Signaling Networks

Linear-Time Haplotype Inference on Pedigrees Without Recombinations

Phylogenetic Network Inferences Through Efficient Haplotyping

Beaches of Islands of Tractability: Algorithms for Parsimony and Minimum Perfect Phylogeny Haplotyping Problems

On the Complexity of SNP Block Partitioning Under the Perfect Phylogeny Model

How Many Transcripts Does It Take to Reconstruct the Splice Graph?

Multiple Structure Alignment and Consensus Identification for Proteins

Procrastination Leads to Efficient Filtration for Local Multiple Alignment

Controlling Size When Aligning Multiple Genomic Sequences with Duplications

Reducing Distortion in Phylogenetic Networks

Imputing Supertrees and Supernetworks from Quartets

A Unifying View of Genome Rearrangements

Efficient Sampling of Transpositions and Inverted Transpositions for Bayesian MCMC

Alignment with Non-overlapping Inversions in O(n ³)-Time

Accelerating Motif Discovery: Motif Matching on Parallel Hardware

Segmenting Motifs in Protein-Protein Interface Surfaces

Filter options

Publication date

Keywords

INFONA - science communication portal

Algorithms in Bioinformatics 6th International Workshop, WABI 2006, Zurich, Switzerland, September 11-13, 2006. Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Algorithms in Bioinformatics
6th International Workshop, WABI 2006, Zurich, Switzerland, September 11-13, 2006. Proceedings